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psj , Abstract 

We present an all-pairs shortest path algorithm whose running time on a complete directed graph on n 
^ ' vertices whose edge weights are chosen independently and uniformly at random from [0, 1] is 0{'n?), in 

expectation and with high probability. This resolves a long standing open problem. The algorithm is 
a variant of the dynamic all-pairs shortest paths algorithm of Demetrescu and Italiano. The analysis 
relies on a proof that the number of locally shortest paths in such randomly weighted graphs is O(n^), 
in expectation and with high probability. We also present a dynamic version of the algorithm that 
recomputes all shortest paths after a random edge update in 0(log^ n) expected time. 
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1 Introduction 

^ The All-Pairs Shortest Paths (APSP) problem is one of the most important, and most studied, algorithmic 
'—'I graph problems. Given a weighted directed graph G = (y, c), on \V\ = n vertices and |£'| = m edges, 
' where c : E ^ is a length (or cost) function defined on its edges, we would like to compute the distances 

J> I between all pairs of vertices in the graph and a succinct representation of all shortest paths. (The length 
O of a path is the sum of the lengths of the edges participatmg m the path.) 

t — , The APSP problem can be solved in 0{mn + n logn) worst-case time by running Dijkstra's algorithm 
" from each vertex of the graph. (See Dijkstra |Tl], Fredman and Tarjan |13J.) A slightly better running 
• time of 0{mn-\-n'^ log logn) was obtained by Pettie [29], building on techniques developed by Thorup [32]. 
\ Karger, Roller and Phillips |21j and McGeoch |23j developed algorithms that run in 0{m*n + n^logn) 

T— I ' time, where m* is the number of edges in the graph that are shortest paths. 

J> ■ Demetrescu and Italiano f7^,Wi (see also Thorup [33]) obtained a dynamic APSP algorithm with an amortized 
^ ■ vertex update time of 0{n'^). Thorup [34j obtained a dynamic algorithm with an 0{n^ '^^) worst-case vertex 
$^ . update time. A vertex update may insert, delete and change the weight of edges that touch a given vertex v. 
An edge update may only insert, delete or change the weight of a single edge. The algorithms of Demetrescu 
and Italiano [TJ |8] and Thorup [33l |3l] can be used, of course, to perform edge updates, but the updates 
times may still be O(n^) and 0(n^'^^), respectively. 

Many researchers developed APSP algorithms that work well on random instances, most notably complete 
directed graphs on n vertices with random weights on their edges. The simplest such model, on which 
we focus in this paper, is the one in which all edge weights are drawn independently at random from the 
uniform distribution on [0,1]. Hassin and Zemel |19j and Frieze and Grimmett jl6j observed that, with 
very high probability, only the O(logn) cheapest edges emanating from each vertex participate in shortest 
paths. Thus, the APSP in this setting can be solved in O(n'^logn) expected time using the algorithms of 
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Karger et al. [21] and McGeoch [23], or by simply selecting the O(logn) cheapest edges emanating from 
each vertex and then running Dijkstra's algorithm from each vertex. All these results actually hold in the 
more general setting in which edge weights are independent identically distributed random variables with a 
common cumulative distribution function F that satisfies F{0) = and -F'(O) exists and is strictly positive. 
(The uniform distribution on [0, 1] with F(x) = x, and the exponential distribution, with F{x) = 1 — e~^ 
clearly satisfy these conditions.) Furthermore, the running time of these algorithms is 0{n? log n) with 
high probability, i.e., probability that tends to 1 as n tends to infinity, and not just in expectation. 

Spira |31j obtained an APSP algorithm with an expected running time of 0(n^ log^ n) for complete directed 
graphs with edge weights drawn in an endpoint independent manner. More specifically, for each vertex v 
a sequence of n positive numbers is chosen by an arbitrary deterministic or probabilistic process. These n 
numbers are then assigned to the n edges emanating from f in a random order, with all n! possible 
ordering being equally likely. Bloniarz [2] presented an improved algorithm with an expected running 
time of 0(n^ log n log* n). Moffat and Takaoka [27] and Mehlhorn and Priebe [M] improved the expected 
running time to 0(re^ logn) and showed that it also holds with high probability. 

Cooper et al. [1] obtained an APSP algorithm with an expected running time of O(n^logn) in the vertex 
potential model in which edge weights may be both positive and negative. 

Meyer [25], Hagerup [H] and Goldberg [17] obtained Single-Source Shortest Paths (SSSP) algorithms with 
an expected running time of 0{m). The m-edge input graph may be arbitrary but its edge weights are 
assumed to be chosen at random from a common non-negative probability distribution. When the edge 
weights are independent, the running time of these algorithms is 0{m) with high probability. 

Friedrich and Hebbinghaus \\^ presented an average case analysis of the dynamic APSP algorithm of 
Demetrescu and Italiano [71 [8] on random undirected graphs. The graphs in their analysis are chosen 
according to the G{n,p) model, in which each edge of the complete graph is selected with probability p, 
and edges of the random graph are given i.i.d. uniform random weights. They show that the expected 
edge update time is at most 0{n'^^^~^^), for any e > 0. This bound is essentially tight when p = 1/n, i.e., 
at the phase transition of the random graph, when the largest component is, with high probability, of size 
0(n^/^). When p > {1 + e')/n, they show that the expected update time is 0{n^/p), for every e > 0. 

Non-algorithmic aspects of distances and shortest paths in randomly weighted graphs were also a subject 
of intensive research in probability theory. We mention here only the results that are most relevant 
for us. Davis and Prieditis [5] and Janson [20J showed that the expected distance of two vertices in a 
complete graph with random edge weights drawn independently from an exponential distribution with 
mean 1 (i.e., F{x) = 1 — e~^) is exactly Hn-i/{n — 1) = (In n)/n -|- 0(l/n), where Hj. = Yl\=i \ is the /c-th 
Harmonic number. The probability that a given edge is a shortest path between its endpoints is also exactly 
Hn-i/ {n — \). Exponential random variables are convenient to work with due to their memoryless property. 
The same asymptotic results hold when the edges weights are chosen independently and uniformly from 
[0, 1]. In the exponential case, the tree of shortest paths from a given vertex has the same distribution as 
a random recursive tree on n vertices obtained using the following simple process: Start with a root; add 
the remaining n — 1 vertices, each time choosing the parent of the new vertex uniformly at random among 
the vertices that are already in the tree. The expected depth of a vertex in such a tree, and hence the 
expected number of edges in a shortest path, is Inn -|- 0(1). For further results regarding recursive trees 
and shortest paths, see Devroye [9], Smythe and Mahmoud [30] and Addario-Berry et al. [Ij. 

In their survey on the algorithmic theory of random graphs. Frieze and McDiarmid |15] state the following 
open problem (Research Problem 22 on p. 28): "Find a o(n^ log n) expected time algorithm for the all pairs 
problem under a natural class of distributions, e.g., i.i.d. uniform on [0, 1]." We solve this open problem by 
giving an O(n^) expected time algorithm for the problem, which is of course best possible. Furthermore, 
our algorithm runs in O(n^) time with high probability and works for both directed and undirected versions 
of the all-pairs shortest paths problem. 

Our 0(n^)-time APSP algorithm is a static version of the dynamic APSP algorithm of Demetrescu and 
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Italiano [71, 18] (see especially Section 3.4 of [8]) with some modified data structures. The novel part of this 
paper is not the algorithm itself, but rather the probabilistic analysis that shows that it runs in 0(n?) 
time, in expectation and with high probability. 

We also obtain an 0(log^ n) upper bound on the expected time needed to update all shortest paths following 
a random edge update, i.e., an update in which a random edge of the complete directed graph is selected 
and given a new random edge weight drawn uniformly at random from [0, 1]. 

The rest of the paper is organized as follows. In Section [2] we sketch the static and dynamic versions of the 
algorithm of Demetrescu and Italiano [71|8] used in this paper. (Complete descriptions of these algorithms 
are given in Appendices |A] and 0) The crucial factor that determines the running time of these algorithms 
is the number of locally shortest paths in the graph. A path is a locally shortest path (LSP) if the paths 
obtained by deleting its first and last edge, respectively, are shortest paths. In Section [3] we collect some 
known and some new results regarding the distances between vertices in randomly weighted graphs. Using 
the results of Section [3l we show in Section [J] that the expected number of LSPs in a complete directed 
graph with independent uniformly distributed random weights is 0{v?). In Section [5] we show that the 
number of LSPs is 0{n'^) with high probability. Sections H] and [5] are the main sections of the paper. In 
Section [6] we show that a fairly simple bucket based priority queue, with a constant amortized update time, 
in conjunction with the fact that the number of LSPs is O(n^), in expectation and with high probability, 
yields the promised 0(n^)-time APSP algorithm. In Section [7] we consider the expected time needed to 
perform random edge updates. Interestingly, the arguments used in Sections [5] and [7] are related, as they 
both focus on the expected number of shortest paths that change when a single edge is given a new random 
edge weight. (The link is the Efron-Stein inequality used in Section[5j) In Section[8]we very briefly consider 
other random graph models. In particular, our algorithm still runs in 0{n'^) expected time in the directed 
G{n,p) model, in which each edge is present with probability p, with independent uniformly distributed 
edge weights, at least when p ^ (In n)/n. We end in Section [8] with some concluding remarks and open 
problems. 

2 The algorithm of Demetrescu and ItaUano 

Our O(n^) time bound, in expectation and with high probability, on the complexity of the solving the 
APSP problem on complete directed graphs with independent edge weight drawn uniformly from [0,1], 
and the O(log^n) expected time bound on the complexity of performing a random edge update are both 
obtained using variants of the dynamic APSP algorithm of Demetrescu and Italiano [7, 8j. 

As our main result is the analysis of these variants, and not the variants themselves, we begin by sketching 
the main features of the variants we use, mentioning only what the reader needs to know to understand 
our analysis. A complete description of the algorithms is given in Appendices |A] and [Bj (We believe that 
our variants are also of some interest, as they are not identical to the algorithms of [Zl[8].) 

Let G = {V, E, c) be a weighted directed graph, where c : E ^ {0, oo) is a cost function defined on its 
edges. (We use weights and costs interchangingly.) For simplicity, we assume that all shortest paths in G 
are unique. Under essentially all probabilistic models considered in this paper, this assumption holds with 
probability 1. (Non- uniqueness of shortest paths can be dealt with as in [7].) We let u — )■ v denote the 
edge {u,v) € E, and let u v denote the (unique) shortest path from n to u in the graph, if they exist. 

The key notion behind the algorithm of Demetrescu and Italiano ^ is the notion of locally shortest paths. 

Definition 2.1 (Locally Shortest Paths) A path is a locally shortest path (or LSP, for short) if the 
path obtained by deleting its first edge, and the path obtained by deleting its last edge, are both shortest 
paths. 
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More formally, if we let u ^ u' v' ^ v denote the path composed of the edge u ^ u' , followed by the 
shortest path from u' to v' , and then by the edge v' ^ v, then u ^ u' v' ^ v is a locally shortest path 
if and only ii u ^ u' v' and u' v' v are both shortest paths. (If u' = v' , then u' v' is an empty 
path.) An edge is considered to be a locally shortest path. (Empty paths are considered to be shortest 
paths.) A shortest path is of course also a locally shortest path. A locally shortest path, however, is not 
necessarily a shortest path. 

2.1 A static version 

We begin by describing a static version of the algorithm of Demetrescu and Italiano [3 [8]. Let G = {V, E, c) 
be a weighted directed graph. The algorithm constructs all shortest paths in G by essentially running 
Dijkstra's algorithm in parallel from all vertices, while only examining LSPs, as explained below. 

For every u,v V, the algorithm maintains a number dist[u,v] which is the length of the shortest path 
from u to V found so far. Initially dist[u,v] is set to c{u,v), if {u,v) G E, or to oo, otherwise. Each pair 
{u,v) & E is inserted into a heap (priority queue) Q, with dist[u,v] serving as its key. The heap Q holds 
all pairs of vertices (n, v) such that at least one path from u to f in the graph was already discovered, but 
the shortest path from u to w was not yet declared. 

In each iteration, the algorithm extracts a pair {u, v) with the smallest key in Q. As in Dijkstra's algorithm, 
dist[u, v] is then the distance from it to v in G. The algorithm then examines LSPs that extend the shortest 
path u V and checks whether they are shorter than the currently best available paths between their 
endpoints. (An extension of a path vr is a path obtained by adding an edge to its beginning or end.) To 
efficiently find the LSPs that extend a shortest path u v, the algorithm also maintains, in addition to 
dist[u,v], the following information for every u,v €V: 

p[u, v] - The second vertex on the shortest path from u to f found so far. 

" The penultimate (next to last) vertex on the shortest path from u to v found so far. 
L[u, v] - A list of vertices w for which w ^ u v is known to be a shortest path. 
R[u, v] ~ A list of vertices w for which u v w is known to be a shortest path. 

If no path from u to v was found yet, then = ^[it, = null. The lists L[u,v] and R[u,v] specify 

the left and right extensions of u v that are known to be shortest paths. Clearly L[ti, and R[u,v] are 
non-empty only after the shortest path u v was identified by the algorithm. 

Suppose that n — > n' f' — )• f , where u' = p[u,v] and v' = q[u,v], was just identified as a shortest path. 
For every w £ L[u,v'], w ^ u v is an LSP. Similarly, for every w € R[u',v], u v ^ w is an LSP. 
These paths are now examined by the algorithm. If, for example, a path w u v is found to be shorter 
then the currently available path from w to v, or is the first path found from w to v, then distlw, v], p[w, v] 
and are updated accordingly and the key of (w,f) in Q is decreased. (If {w,v) is not already in Q, 

it is inserted into Q.) 

This is the gist of the static version of the algorithm of Demetrescu and Italiano [T', '8], which, for concrete- 
ness, we refer to as algorithm apsp. For a complete description and pseudo-code, see Appendix [Al 

As algorithm apsp uses a priority queue, its running time depends on the characteristics of the priority 
queue used. For a specific implementation, we let Tins{n) ^Tdec{n) and Text{n) denote the (amortized) times 
of inserting an element, decreasing the key of a given element, and extracting an element of minimum key 
from a priority queue containing at most n elements. We next claim: 

Theorem 2.2 // all edge weights are positive and all shortest paths are unique, then algorithm apsp 
correctly finds all the shortest paths in the graph. Algorithm apsp runs in 0{n'^ ■ {Tins{n'^) + Te^t(n^)) + 
l^cSPl • Tdec('T'^)) time, where \CSV\ is the number of LSPs in the graph, and uses only 0{n^) space. 
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The proof of Theorem 12.21 which is essentially identical to the correctness proof given by Demetrescu and 
Italiano [3 [8], can be found in Appendix [Xl 

If we use the Fibonacci heaps data structure (Fredman and Tarjan [13]) that supports extract-min opera- 
tions in O(logn) amortized time, and all other operations in 0(1), amortized time, where n is the number 
of elements in the heap, we get a running time of 0{'n? logn + 1^257^1). There are, thus, two hurdles on 
our way to getting an expected 0(n^)-time algorithm. First, we have to show that \CSV\ is O(n^), under 
natural probability distributions, in expectation and with high probability. We do that in Sections S] and [5j 
Second, we have to find a faster way of implementing heaps. We do that in Section [6] using a bucket based 
implementation. 

2.2 A dynamic version 

The static algorithm of the previous section examines all locally shortest paths in a graph, but (implicitly) 
maintains only those that are currently shortest. The dynamic algorithm, on the other hand, explicitly 
maintains all locally shortest paths, even if they are already known not to be shortest paths. 

For every path tt, we let /[vr] be the path obtained by deleting the last edge of n, and r[7r] be the path 
obtained by deleting the first edge of vr. A path vr is represented by keeping its total cost, its first and last 
edges, and pointers to its subpaths /[vr] and r[7r]. The collection of all paths maintained by the algorithm 
is referred to as the path system. 

For every pair of vertices u,v &V, the dynamic algorithm maintains a heap P[u, v] that holds all the LSPs 
connecting u and v found so far. The key of each path is its cost. As in the static case, dist[u,v] is the 
cost of the shortest path 7r[u, v] from n to w found so far. 

For every LSP vr, the dynamic algorithm maintains four lists of left and right extensions of vr. The lists 
5-L[vr] and S'i?[vr] contain left and right extensions of vr that are known to be shortest paths. The lists -L[vr] 
and -R[vr] contain extensions of vr that are known to be LSPs. 

Let E' be a set of edges whose costs are changed by an update operation. (We are mostly be interested 
in the case in which E' is composed of a single edge, but the description below is general.) The dynamic 
algorithm recomputes all shortest paths as follows. First all LSPs containing edges of E' are removed from 
the path system. (Note that each edge of E' is an LSP, and is thus contained in the path system. All LSPs 
containing edges of E' can be found by recursively following the extension lists of these edges.) 

For every pair of vertices u,v £ V such that the shortest path from u to v before the update passes 
through an edge of E', and was therefore removed from the path system, the algorithm finds the cheapest 
path in P[u,v], if at least one such path remains, and assigns it to vr[ii, w]. It then inserts the pair {u,v) 
into a global heap Q. The key of {u,v) in Q is the cost of vr[?x, f]. Next, it recreates single-edge paths 
corresponding to the edges of E' , with their new edge weights, and examines them. 

The dynamic algorithm now starts to construct new shortest paths. In each iteration it extracts from Q 
a pair {u,v) with the smallest key. As in the static case, the path vr[ii, is then a shortest path from u 
to V. LSP extensions of vr[u, f], obtained by combining vr[n, u] with paths that are already known to be 
shortest paths, are now generated. If such an extension is shorter than the currently shortest available 
path containing its endpoints u' and v' , then tt[u',v'] and dist[u',v'] are updated accordingly, and {u',v') 
is inserted into Q with the appropriate key. (If {u',v') is already in Q, its key is decreased.) 

An important difference between the dynamic variant used in this paper and the dynamic algorithm of 
Demetrescu and Italiano [3 [8] is that when a path vr stops being a shortest path, it, and all its extensions, 
are immediately removed from the path system. A similar dynamic variant was used by Friedrich and 
Hebbinghaus [H]. The algorithm of Demetrescu and Italiano [3 [8] keeps such paths as historical and 
locally historical paths. (See also Demetrescu et al. [6].) 
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The most impressive feature of the dynamic algorithm of Demetrescu and Itahano [71 fS] is that its update 
time is proportional to the number of shortest and locally shortest paths that are destroyed and/or created 
by the update operation. The algorithm does not spend time on shortest paths that remain unchanged. 

Let SV and CSV^ be the sets of shortest and locally shortest paths destroyed by an update operation. 
Similarly, let SV^ and CSV^ be the sets of shortest and locally shortest paths that are created (or 
recreated) by an update operation. Note that SV~ and SV^ , and CSV~ and CSV^ , are not necessarily 
disjoint, as paths passing through edges of E' are first destroyed, and removed from the path system, but 
may then be recreated. Let A be an upper bound on the number of LSPs that connect any given pair of 
vertices before and after the update. 

A complete description of the dynamic variant sketched here and its correctness proof are given in Ap- 
pendix |Bl where the following theorem is proved. (update(£", c') is the function that updates all shortest 
paths following a change in the costs of the edges of £".) 

Theorem 2.3 The running time o/ update (E'', c') is 

O (1 I • (T^ei ( A) + T„i„ ( A) + ri„, ) ) + 1 I • Te^t ) + I I • Trfe/ ( A) + I 1 • {Tins ( A) + T^ec (r^' ) ) • 

Here, Tins[n) ,Tdei{n) ^Tdecin) ,Text{n) and Tmin{n) are the (amortized) times of inserting, deleting, decreas- 
ing the key, extracting the element of minimum key, and finding the element of minimum key of a priority 
key containing at most n elements. 

We show in Section [7] that for a random edge update we have E[ | ], E[ |5'P'*'| ] = O(logn) and that 
E[|£5P"|],E[|£5P+|] = O(log^n). We also show that A = O(logn), with high probability. Using 
appropriate implementations of the priority queues, we get an expected edge update time of O(log^n). 

3 Distances in complete randomly weighted graphs 

Let Kn = {V-, E) be a complete directed graph on n vertices and let a,b £ V. We let W{a, b) be the random 
weight attached to the edge (a, b). We assume at first that W{a, b) is an exponential random variable with 
mean 1, i.e., W{a,b) ~ EXP(1). Due to the memoryless property, dealing with exponentially distributed 
edge weights is easier than dealing directly with uniformly distributed edge weights. We later explain why 
all the results derived in this section for exponential edge weights also hold, asymptotically, for uniformly 
distributed edge weights. All n{n — 1) random edge weights are assumed to be independent. (Self-loops, 
if present, may be ignored.) Let D{a,b) be the distance from a to 6 in the graph, i.e., the length (sum of 
weights) on the shortest path a 6 in the graph. (The shortest path a 6 is unique with probability 1.) 
Note that D(a, b) is now also a random variable. For k G {1, 2, . . . , n — 1}, we let Dk{a) be the distance 
from a to the fc-th closest vertex to a. 

Let Hk = X]fc=i \ be the fe-th Harmonic number. It is known that Hn = Inn + 7 + O(^), where 7 = 
0.57721 ... is Euler's constant. 

The following five lemmas can be found in Janson [20]. (The expectation of D{a, b), but not the variance, 
can also be found in Davis and Prieditis [5]). The lemmas in [20] are stated for undirected graphs, but it 
is easy to check that they also hold for directed graphs. 

Lemma 3.1 Let a and k S {1, 2, . . . , n — 1}. Then, 

Dk{a) = V — '—r , 
i=i ^ ' 

where Xi,X2, ■ ■ ■ ,Xk are i.i.d. exponential variables with mean 1. 
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Lemma 3.2 Let a^^b eV. Then, 



D{a,b) = DL{a) = ^ 



^ i{n — i) ' 

where Xi,X2, ■ ■ ■ ,Xn-i are i.i.d. exponential variables with mean 1, and L is chosen uniformly at random 
from {1, 2, . . . , n — 1}. 



Lemma 3.3 Let a ^ b ^ V . Then, 

E[D(a,6)] = ^ = i^ + 0(i) , Yar[D{a,b)] = ^ + oi^). 

Lemma 3.4 For any constant c> 3, we have 



n 



c In n 

maxiJio, o) > 

a,b n 



Lemma 3.5 Let a ^ b G V . Then, the probability that the edge a ^ b is a shortest path is — 
^ +0(i). 

The next two lemmas are new and might be interesting in their own right. They are used in Section [5] to 
show that the running time of algorithm apsp is 0{n'^) with high probability. The proof that the expected 
running time of apsp is 0{n'^), given in Section HI does not rely on them. 



Lemma 3.6 Leta^^beV. If n'" <a< 1/2, then F[D{a,b) > (1 + 12a)^] < Sn"". 

Proof: Let Sk,e = Yli=k i(n-i) • allow k,i to be non-integral, in which case, we have Sk,e = 5'[fc],[£j-) 
By Lemma [3.21 we get that D(a,b) = Si^l, where L is uniformly distributed in {l,2,...,n — 1}. Let 
m = n^~". We clearly have 



We now decompose 



Now 



F[D{a,b) > Si^n-m] = P[L>n-m] < m/n = n"". 



-^1 

S\ n—m, ^ 7 + m + 'S'm,n/2 "l~ Sn/2,n- 

n — 1 ' ' 



Xi In n 
— > 2a 

n — 1 n 



< F[Xi > alnn] = n"°. 



Let Y = Using our assumption that n " < a we get that 



S2,m < 



Y 



Y 



Y Y 

= 7 ^ < 7 ^ < (l + 2a) — . 

n — m (1 — n ")n (1 — a)n n 



Now E[e 



AX, 



(1 - A)-\ for A < 1, so 



E [( 



E 



i=2 i=2 ^ ^ i=2 



-1 m 



m. 



Therefore, 



^2,™-! > (1 + 2a) 



Inn 



n 



< F[Y > Inn] = P[e'*^ > n] < 



E[e^ 



n 



m 



n 



n 



(1) 
(2) 
(3) 



(4) 
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Let Z = YJi!^^ ^- Clearly, 5^,„/2 < f • We again have 



E[e^ 



n/2 



< 



n 



m 



n . 



Therefore, 



Inn 



n 



< F[Z > 2a Inn] 



> n^-1 < 



n 



2a 



n 



As s-iid Sn/2,n-m have exactly the same distribution, we also get that 

In n 



Sn/2,n-m ^ 4q!- 



n 



< n 



Using (l)-(6) together, we get that P [D{a,b) > (1 + 12a)^] < Sn"", we required. 



(5) 

(6) 
□ 



No attempt to optimize the constants appearing the statement of Lemma 13.61 The condition n^" < a in 
the lemma is satisfied for any fixed a > 0, when n is large enough. It also holds when, say, a = a(n) = 
(In Inn)/ Inn. 

The proof of our next lemma relies on the following large deviation theorem of Maurer |22] . 



Theorem 3.7 (Maurer [22| ) LetYi,Y2, . . . ,Yn be non-negative independent random variables with finite 
first and second moments and let S = Y17=i t > 0. Then 

+2 



IE[5] -S>t 



< exp 



For a vertex a & V and r > 0, let Ball{a, r) = {b £ V \ D(a, b) < r} be the ball of radius r centered at a. 
We next bound the probability that Ball{a,a^^) is exceptionally large. 



Lemma 3.8 For any a V , a < 1 and c> we have 



Ball I a, a 



n 



> cn 



< exp 



In^ c 



Proof: Note that \Ball{a,r)\ > /c if and only if Dk{a) < r. By Lemma 13.11 we have Dk = Dk{a) 
, where Xi,X2, ■ ■ ■ are i.i.d. exponential variables with mean 1. Thus, 



k k 
1 \ ^ 1 1 \ - 1 Ink 

= Y.-! — ^ > -E-> — • 

^-^ i[n — i) n ^-^ I n 

1=1 



As E[X2] = 2, we have 



i=l 



X,, 



^ n — I 



< 



i'^in — i)"^ ^-^ i?{n — i) 



n/2 



< 



^ i'^{n — iy 



^ 16 1 ^ IGvr^ ^ 30 

^2 9'2 Q 77,2 



n^ z — / 



With A; = cn" we get that E[Z)fc] > + ^^^^ and by Theorem E21 with Yi 



X,, 



a In n 




Dk < 


< P 


n 





nDk] -Dk> 



Inc 



n 



< exp 



60 



i(n—i) 

exp 



, we have 



In^c 



60 



□ 



As an immediate corollary, we get: 
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Corollary 3.9 For any aE^, a<l, e>0 and c > we have 

Inn' 



Ball a, a 



n 



> n 



a+e 



0{n- 



The results of this section were derived under the assumption that the edge weights are exponential. How- 
ever, as explained in detail in the beginning of Section 2 of Janson [20J, the same results hold asymptotically 
also for the uniform distribution. For the sake of completeness we show how to deduce from Lemma 13.61 
and Corollarv 13.91 similar claims for uniform distributions. 

Let G be a complete directed graph on n vertices with independent uniformly distributed edge weights 
W{a,b) and let D(a,b) be the distance from a to & in this graph. Define W'{a,b) = — ln(l — W{a,b)) 
and let G' be a complete directed graph whose edge weights are W'{a,b). Denote by D'{a,b) the distance 
from a to 6 in G'. Note that all the edges of G' have weights distributed as independent exponential 
random variables with mean 1 and that the correspondence between G and G' is a measure preserving 
transformation. It is easy to check that z < — ln(l — z) < z + 2z^ for all < z < 1/2. 

Suppose that G has the property that D{a,b) > (1 + 12a)^. Since D'{a,b) > D{a,b), each such G 
corresponds to a graph G' which also has D'{a, 6) > (1 + 12a)^. Therefore by Lemma [3T6] the probability 
of this event is at most 5n~". Suppose that 6 is a vertex of G satisfying D{a,b) < Then, by the 

above inequality, we have that D'{a, b) < a^-^ + 2a^^^ = a'^ with a' = (l + 0(^))a. For any e > 0, 
let e' = e/2. Then it is easy to check that n"'^'^' < n"^'^. Therefore all G in which \Ball (a, a^^) | > n°'~^'' 
correspond to instances of G' in which \Ball (a,a'^^) | > . By Corollary 13.91 the probability of this 

event is at most 0{n~^) for any c > 0. 



4 The expected number of locally shortest paths 

Let CSV be the set of LSPs in Kn- Our goal in this section is to show that E[|£5'P|] = O(n^). This would 
follow immediately from the following lemmas. 

Lemma 4.1 Let a,b,c be three distinct vertices. The probability that a ^ b ^ c is an LSP is 0(^^^). 

Proof: The path o — >■ 6 — >■ c is an LSP if and only if both a ^ b and b ^ c are shortest paths. By 
Lemma 13.51 the probability that each one of the edges a ^ b and 5 ^ c is a shortest path is + 0(i). 
Unfortunately, the events "a — > 6 is a shortest path" and "6 — > c is a shortest path" are not independent 
(and probably positively correlated). To circumvent that, let Vl, V2 C V such that V1UV2 = V, a G Vi, 
c G V2, Vi n V2 = {6} and \Vi\, IV2I > 'n/2 be a fixed partition of the vertex set V. If a ^ 6 and b ^ c are 
shortest paths in G, then a — > 6 is clearly also a shortest path in G[Vi], the subgraph of G induced by Vi, 
and 6 — )• c is also a shortest path in G[V2]. These events are now independent, as the edge sets of G[Vi] 
and G[V2] are disjoint. The probability that a ^ 6 — )• c is an LSP is thus at most (^^W?) + 0(i))2. □ 

Lemma 4.2 Let a, b, c, d be four distinct vertices. The probability that a^b~^c^disan LSPisO{^). 

Proof: If a — )• 6 ~^ c ^ d is an LSP, then by definition 

W{a,b) + D{b,c) = D{a,c) , D{b,c) + W{c,d) = D{b,d). 

If a —7- 6 ~^ c —7- d is an LSP, then b c does not pass through a or d. (If, for example, b c passes 
through a, then a c is a subpath of 6 c, and a — >■ 6 c is therefore not a shortest path, contradicting 
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the assumption that a^b-^c^d is an LSP.) Thus, D{b, c) = Da^d{b, c), where Da^dib, c) is the distance 
from 6 to c when a and d are removed from the graph. We also clearly have D{a,c) < Db^d{cL,c) and 
D{b,d) <Da,c{b,d). 

Thus, if a — >• 6 ~^ c — > (i is an LSP, then 

W{a, b) + Da^ib, c) < Dh,d{a, c) , Da,d{b, c) + W{c, d) < Da,c{b, d), 

or equivalently 

W{a, b) < Dh,d{a, c) - c) , Ty(c, d) < Da^b, d) - Da,d{b, c). (*) 

It is thus sufficient to bound the probability that (*) happens. For brevity, let 

X = Da4{b,c) , Y = Di,4{a,c) , Z = Da,c{b,d). 

A crucial observation now is that X,Y and Z do not depend on W{a,b) and W{c,d). This follows from 
the fact that in each one of these distances one of a and 6, and one of c and d, is removed from the graph. 

We can thus choose the random weights of the edges in two stages. First we choose the random weights of 
all edges except the two edges a ^ b and c ^ d. The values of X, Y and Z are then already determined. 
We then choose W{a,b) and W{c,d), the random weights of the two remaining edges. As the choice of 
W{a,b) and W{c,d) is independent of all previous choices, and as W{a,b) and W{c,d) are independent 
and uniformly distributed in [0, 1], we get that 

P[(*)] = E[{Y - X)+ ■ {Z - X)+] < E[\Y - X\\Z - X\], 

where x"*" = max{x,0}. (Note that we are not assuming here that X,Y and Z are independent. They are 
in fact dependent.) 

We next note that each of X, Y and Z is the distance between two given vertices in a randomly weighted 
complete graph on n — 2 vertices. Thus, E,[X] = E\Y] = E[Z]. By Lemma 13.31 we have 



Now, 



Var[X] = Var[y] = Var[Z] = (1 + o(l))^. 

2n^ 



0] < E[\Y - X\\Z - X\] < ^(E[{Y - Xf] + E[{Z - Xf] 



using the trivial inequality xy < ^(x^ + y^). All that remains, therefore, is to bound — ^)^] and 

E[{Z - Xf]. Let n = E[X] = E[Y]. Then, 

E[{Y-Xy] = E[{{Y - - {X - 

< 2{E[iY - t,)'] + E[iX - ^f]) 

= 2(Var[y] + Var[X]) 

= (i + o(i))4, 



using the inequality (x — y)'^ < 2{x^ + y^). Exactly the same bound applies to E\{Z — X)'^]. Putting 

2 

everything together, we get that P[(*)] < (1 + o(l))^. □ 



Theorem 4.3 E[\CSV\] = Q{n'^) 
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Proof: The number of LSPs of length 1 is exactly n(n — 1). (Every edge is an LSP of length 1.) By 
Lemma l4.ll the expected number of LSPs of length 2 is 0{n^ ■ ^^t^) = O(nln^n). By Lemma 14.21 the 
expected number of LSPs of length greater than two is 0(n^ ■ = 0{v?). □ 

Experiments that we have done seem to suggest that £[1/^5^1] is very close to + l)n^ ~ 2.64n^. 

The results of this section were stated and proved for directed graphs. It is easy to check, however, that 
our methods can be also used to provide an all pairs shortest paths algorithm with a quadratic running 
time for the complete undirected graphs on n vertices with uniform edge weights. 

5 High probability bound on the number of locally shortest paths 

Our goal in this section is to show that the number of LSPs is 0(v?) asymptotically almost surely (a.a.s), 
i.e., that there exists a constant c such that P[|/35'P| < cn^] — )■ 1, as n — >■ oo. 

Let E* be the set of edges that are shortest paths. Let A be the maximum outdegree in the subgraph 
G* = {V,E*). (McGeoch [23] refers to G* = {V,E*) as the essential subgraph.) We first show that 
A = O(logn), with very high probability. 

Lemma 5.1 For every c> 6, we have P[A > clnn] = 0(n^~''^/^). 

Proof: Let G' = {V,E') be the subgraph of G composed of all edges of weight at most §^^, and 
let A' be the maximum outdegree in G' . The outdegree of each vertex in G' is binomially distributed 
with parameters n and f A special case of Chernoff bound (see, e.g., [26], p. 64) states that if X is 
a binomial variable with fj, = IE[X], then ¥[X > 2fi] < e~^^^. Thus, the probability that the degree of 
a given vertex exceeds clnn is at most n""^/^. Thus P[A' > clnn] < n^~^^^. Now, A > A' only if at 
least one distance in G is greater than By Lemma 13.41 the probability that this happens is at most 

Q^^3-c/2 ^Qg2 c > 6 we have 1 - c/6 > 3 - c/2. □ 

The following lemma is trivial and can also be found in Demetrescu and Italiano [Tj. 
Lemma 5.2 If all shortest paths are unique, then \CSV\ < An^. 

Proof: Every LSP is obtained by appending an edge which is itself a shortest path to some shortest path. 
The number of shortest path in a graph is at most n^ (assuming uniqueness) and each one of these shortest 
path can be extended by at most A edges. □ 

Note that Lemmas 15.11 and 15.21 implv that the number of LSPs is 0(n^ log n) with high probability. To 
improve this bound to O(n^) we need to work harder. 

Definition 5.3 (/3-short paths) Let f3 > be a (small) constant. We say that a shortest path vr is f3- 
short if and only if its length is at most (1 + 13)^^, and /3-long, otherwise. Similarly, we say that an LSP tt 
is /3-short if both shortest paths Zfvr] and r[7r] obtained by removing its first edge and last edge are short, 
and ;5-long, otherwise. Let SV^ , SV^ , CSV^ , CSV^ he the sets of (3 -short and P-long shortest and locally 
shortest paths. (Note that these sets depend on the parameter (3.) 

Clearly, I^CiS^I = \CSV^ \ + \CS'P^\. We estimate separately the number of /3-long LSPs and the number 
of /3-short LSPs. We begin by bounding the number of /3-long shortest paths and locally shortest paths. 

Lemma 5.4 For every /3 > 0, we have ¥.[\SV^\] = 0{n^'l^/^'^). 
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Proof: By Lemma 13.61 with a = /3/12, we get that for any a 7^ 6 € ^ we have 



P[D(a,6) > + 1 = 0{n-^l^'^). 

n 

The lemma follows by the linearity of expectation. □ 
Lemma 5.5 For every ^ > 0, we have EUCSV^l] = 0{n'^->^/'^'^lnn). 

Proof: Using the same argument used in the proof of Lemma 15.21 we get that \CSV^\ < AlSV^l- By 
Lemma l5. II we get 

E[\CSV^\] < E[A\SV^\] < clnn-E[\SV^\]+n^-''/^n^ . 
Letting c = 12 and using Lemma 15.41 we get that E[\£SV^\] = ©(n^-^/^^in n), as required. □ 

Lemma 5.6 For every f3 > we have F[\£.SV^\ > n^] = ©(n^^/^^ Inn). 

Proof: Follows from Lemma [531 using Markov's inequality. □ 

We next show that ICSV^l = 0{n^) with high probability. To do that we use the Efron- Stein inequality 
(see, e.g., Boucheron et al. [3j) to bound Var[|/35P'^|]. 

Theorem 5.7 (Efron-Stein inequality) Let Z = f{Xi,...,Xm), where Xi, X2, ■ ■ ■ , Xm are indepen- 
dent random variables. For any 1 < i < m, let X'- be a random variable with the same distribution as Xi 
but independent from Xi, X2, ■ ■ ■ , Xm, and let Z'- = f{Xi, . . . , X'-, . . . , Xm)- Then, 

^ m 

Var[Z] < -Y^E[{Z-Zlf . 

1=1 

In our case, we have m = n{n — 1), Xi,X2, . . . jXm are the random edge weights, and Z = \CS'P^\. For 
every edge e, we need to compute the second moment of the random variable \CS'P^q\ — \CSV^i\^ where 
CSV^^Q and CSV^ I are the sets of /3-short LSPs when all edges other than e are assigned the same random 
edge weights, while e is assigned two independent edge weights. Due to symmetry, the second moment of 
\CSV°q\ — \CSV%^i\ does not depend on e. For brevity, we write CSVq and CSVi, instead of CSV°q and 
CSVgi, when the e is clear from the context. We similarly define SVq and SVf to be the corresponding 
sets of /3-short shortest paths. 

If A and B are two sets, then ||^| — |i?|| < |A B\, where A (B B = {A B) L) {B \ A) is the symmetric 
difference of the two sets. We thus focus our attention on CSVq © CSVf. We begin by looking at 
SVq © SVf. Let ^^^(e) and SVf{e) be the set of /3-short shortest paths that pass through e with the 
two choices of the weight of e. 

Lemma 5.8 For every edge e we have \SVq @ SVf\ < 2{\SVo{e)\ + |57^f(e)|). 

Proof: Let co(e) and ci(e) be the two costs of e. Suppose at first that co(e) < ci(e). A /3-short shortest 
path that stops being /3-short shortest path when the cost of e is increased from co(e) to ci(e) must pass 
through e. Thus, SV^ \5Pf C SV^{e) and hence \5Pf | < \SV^{e)\. The only paths in SVf^SV^ 
are paths that replace paths from SVq \ SVf. Thus, we also have ISVf \ SVq \ < 157^0(6)1. Under the 
assumption co(e) < ci(e) we thus get \SVq (B SVfl < 2\SVQ{e)\. If co(e) > ci(e), we similarly get that 
\SV^ (BSVfl < 2|5-Pf(e)|. In both cases we have \SV^®SVf\ < 2{\S'P^{e)\ + \SVf{e)\). □ 

We next estimate \SVQ{e)\ and \SVf (e)|. As they both have the same distribution, we omit the subscript. 
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Lemma 5.9 For every ^ > 0, we have F[\S'P^{e)\ > 0] = O(^). 

Proof: Theset5P'^(e) is non-empty only if e is a shortest path between its endpoints, which by Lemma [3.5l 
only happens with probability ^^ + O(^). □ 

Our next goal is to show that |5'P'^(e)| = 0{n^^/^'), with high probability, for any /3' > /3. 

Lemma 5.10 For every /3 > 0, and every /3' > /3, we have F^SV^ {e)\ > n^^^'] = 0{n~'^), for every c > 0. 

Proof: Let e = a — )• 6 be a fixed edge. Let C be the set of pairs {u, v) such that u-^a—^b-^v is a 
shortest path of length at most Clearly |5'P'^(e)| = |C|. For a fixed integer r, and 1 < i < r, let 



Ai = iueV 



„/ ^ + B)\nn , „ , 

D(u,a)<^—-^ ]■ , Bi = -{veV 

r n 



r n 



be the sets of vertices of distances at most ^^^^-^^^ to a and from b, respectively. Note that Bi 
Clearly 



Ball ( b, ^ ) , while Ai = Ball ( a, ^ ^^f^ ) , in the graph in which all edge directions are reversed. 



C C IJ Ai X Br+l-i . 
i=l 

By Corollary ESI we have \Ai\,\Bi\ < for 

every i, with a probability of at least 1 — 0{rn '^), for 
every c > 0. It thus follows that \C\ < rn^-'^"^^)^^"'"^/'')"'"^^, again with this very high probability. Letting r 
sufficiently large and e sufficiently small, we get the claim of the lemma. □ 

Lemmas [53] and ElO] allow us to bound E[|5P^(e)p]. 

Lemma 5.11 For every P > 0, and every (3' > j3 we have E[|5'P'^(e)p] = 0{n'^^^^^">~^ Inn). 

Proof: For succinctness, let X = |5'P'^(e)| and a = n^^^' . We always have X'^ < n^. Using Lemma 15.101 
with c = 4, we have 



E[X2] < P[0 < X < a] • + P[x > a] • = O ■ 71^^'+"'^ + n'^ ■ n^) = O (71^^'+"'^-' In 



n 



a 

We can finally get back to estimating CSVq ® CSVf. Let Aq and Ai be the maximum outdegrees in the 
essential graph, i.e., the subgraph composed of the edges that are shortest paths, under the two independent 
choices of the weight of e. Let A = max{Ao, Ai}. By Lemma |5. II we have P[A > clnn] = ©(n^"*^/^), for 
every c > 6. 

Lemma 5.12 For every (3 > Q we have \CSV^ CSVf\ < 2 A • \SV^ © STf\. 

Proof: Suppose that vr € CSV^ \ CSVf. Then either /[vr] € SV'^ \ SVf or r[7r] € SV'^ \ SVf. 
Each shortest path in SVq has at most Aq pre-extensions and at most Aq post-extensions that are locally 
shortest paths. Thus, ICSV^^CSVfl < 2Ao|5Pf \5Pf |. Similarly, |£5Pf \£5Pf| < 2Ai\SVf \Svf\, 
and the lemma follows. □ 
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Lemma 5.13 For every f3 > 0, every /3' > /3, and every edge e we have 



E 



\csv?\ - ICSV^W^ 



0(^^2(i+/3')-iin3n). 



Proof: By Lemmas 15.81 and 15.121 we have 

< 16A'^{\SVl{e)\ + \SVf{e)\f < 32A2(|5Pf (e)p + |5Pf (e)|2) 
Since both 15^0(6)1, \SVf (e)| < n^, from Lemma 15.11 with c = 24 we have 



E 



£SVf\-\CSV^\l < O (In^ n • E[|57'* (e)n+P[A> 24 In n]-n^) 



= 0{ln^n-E[\SV^{e)\'^]+n-^ ■n'^) . 

The claim now follows from Lemma 15.111 □ 
Using the Efron-Stein inequality (Theorem 15. 7p we thus get: 

Lemma 5.14 For every /3 > and every 13' > 13 we have Var [|£5'P'^|] = O ^n^(^+^')+^ In^ n 

Theorem 5.15 There is a constant c such that P[[£5'P[ > cn^] = 0(n^^/^^). 
Proof: Let /3 = By Lemma I^TUl we get that 

¥[\jZSV^\ > n^] = 0{n-l^/^^lnn) = 0{n-^/^^ Inn) . (7) 
Let 13' = (3 + e, where e > is tiny. By Lemma 15.141 we get that 

Var[|/:5P^|] = 0(n2(i+'5')+iln3n) = 0{n^'>/^'>+'nn' n) = ©(n^^/^s+s^). 



By TheoremSSJwe have E[\CSV\] = e{n^). By Lemma[531 we have EWCSV^l = o{n^). As E[\CSV\] = 
E[\CSV^\]+E[\CSV^\], we get that E[\CSV^\] = e{n^). 

By Chebyshev's inequality (see, e.g., [26j), for every random variable X we have 

F[X>2E[X]] < F[\X-E[X]\>E[X]] < . 

For X = and using the facts that E [\CSV^\\ = e{n^) and Yar[\CSV^\] = ©(n^s/^s+Se)^ ^hus 

get 

¥\\CSr'\>2E[\CSV''\]] < [l^-^^^^l] ^ 0(^-1/25+3.) _ (g) 

E[\CSV^\] 

As \CSV\ = ICSV'^l + {CSV^l, combining (7) and (8) and choosing e small enough, we get the claim of 
the Theorem. □ 

We believe that for every a > there exists c such that P [1/^57^1 > cn^] = 0(n~°). Proving, or disproving, 
this claim would require new techniques. 
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6 An 0(n^)-time implementation 



In this section we describe an implementation of the algorithm of Section [2.11 (and Appendix that runs 
in O(n^) time in expectation and with high probability. This is done using a simple observation of Dinic 
[12] and a simple bucket-based priority queue implementation that goes back to Dial [TO] . 
Let 5 = min(„ c(n, v) be the minimal edge weight in the graph. We claim that algorithm apsp of 
Section [2.1l remains correct if instead of requiring that the pair {u,v) extracted from the heap Q is a pair 
with minimal dist{u,v), we only require that dist{u,v) < dist{u' ,v') + 5 for every other pair {u',v') in Q. 
The proof is a simple modification of the proof of Theorem 12.21 given in Appendix El This observation, in 
the context of Dijkstra's algorithm, dates back to Dinic [12j. Along with many more ideas, this observation 
forms the basis for the linear worst-case time single-source shortest paths algorithm for undirected graphs 
obtained by Thorup |32j . It is also used by Hagerup [18] to obtain a simple linear expected time algorithm 
for single source shortest paths, simplifying results of Meyer [25| and Goldberg [17] . 

In our setting, edge weights are drawn independently and uniformly at random from [0, 1] . The probability 
that the minimal edge weight is smaller than n~'^'^ is clearly at most n~^'^. If this unlikely event happens, 
we simply use an O(n'^logn) time implementation based on Fibonacci heaps. This only contributes o(n'^) 
to the expected running time of the algorithm. 

We assume now that 6 > For every u,v £V, we let dist'{u,v) = [dist{u,v)/S\ and use dist'{u,v), 

instead of dist{u,v), as the key of {u,v) in Q. 

We implement the heap Q as follows. (There are many possible variants. We describe the one that seems 
to be the most natural.) We use L = in? buckets i?i,i?2, • • • ,Bl. Bucket Bi, for i < L, is a linked list 
holding pairs {u,v) for which dist'{u,v) = i. Bucket Bl is a special leftover bucket that holds all pairs 
{u,v) for which dist'{u,v) > L. It is again implemented as a linked list. We also maintain the index k of 
the bucket from which the last minimal pair was extracted. 

The implementation of a heap-insert operation is trivial. To insert a pair {u,v) into Q, we simply add 
{u,v) to Bi, where i = m.m{dist'{u,v), L}. 

A decrease-key operation is also simple. We simply remove (u, v) from its current bucket and move it to 
the appropriate bucket. (Each pair has a pointer to its position in its current bucket, so these operations 
take constant time.) 

An extract-min operation is implemented as follows. We sequentially scan the buckets, starting from B/^, 
until we find the first non-empty bucket. If the index of this bucket is less than L, we return an arbitrary 
element from this bucket and update k if necessary. If the first non-empty bucket is Bi, the leftover bucket, 
we insert all the elements currently in Bl into a comparison-based heap and use it to process all subsequent 
heap operations. (We show below that in our setting, we would very rarely encounter this case.) 

This implementation of the extract-min operation is correct as the priority queue that we need to maintain 
is monotone, in the sense that the minimal key contained in the priority queue never decreases. This follows 
immediately from then fact that keys of new pairs inserted into Q, or decreased keys of existing pairs in Q 
are always larger than the key of the last extracted pair. 

The total time spent on implementing all heap operations, until all buckets Bi,...,Bi-i are empty, 
is clearly 0{N + L), where N is the number of heap operations performed. By Theorem 12.21 we have 
= 0{\CSV\ +n^). By Theorem 14.31 we have E[[/35'P|] = O(n^). By Theorem I5.15| there is constant c 
such that P[|/^5'P| > cn^] = 0{n^^^^'^). As L = n^, the number of operations here is O(n^), both in 
expectation and with high probability. 

All that remains, therefore, is to show that the probability that Bl will be the only non-empty bucket 
is tiny. Note that this happens if and only if there is a pair u,v G V for which D{u,v) > LS > n~^'^. 
By Lemma 13.4^ this probability is 0{n~^) for every c > 0. If this extremely unlikely event happens, the 
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running time is only increased to 0{n? log n), which has a neghgible effect on the expected running time 
of the whole algorithm. We have thus obtained: 

Theorem 6.1 The expected running time of algorithm apsp, when implemented using a bucket-based pri- 
ority queue, and when run on a complete directed graph with edge weights selected uniformly at random 
from [0, 1] is O(n^). Furthermore, there is a constant c > such that the probability that the running time 
of the algorithm exceeds cv? is 

7 Polylogarithmic update times 

In this section we consider the expected time needed to update all shortest paths following a random edge 
update, i.e., an update operation that chooses a random edge e of the complete directed graph, uniformly at 
random, and assigns it a new random weight, independent of all previous weights chosen, drawn uniformly 
at random from [0, 1]. 

Recall that SV^ and CSV~ are the sets of shortest and locally shortest paths destroyed by an update 
operation, and that SV^ and CSV^ are the sets of shortest and locally shortest paths that are created 
(or recreated) by an update operation. More specifically, we have 

SV- = SVo{e) U {SVo \ SVi) , 

SV+ = SViie) U (SVi^SVo) , 

CSV- = CSVoie) U {CSVo ^ CSVi) , 

CSV+ = CSViie) U {CSVi \ CSVo) , 

where, as in Section [5l SVq and SVi are the sets of shortest paths before and after the update of e, and 
'5'Po(e) and SVi{e) are the sets of shortest paths, before and after the update, that pass through e. The 
sets CSVq, CSV I, CSVo{e) and CSVi{e), are the corresponding sets of locally shortest paths. 

Our main goal is to bound the expected sizes of the sets SV~ , SV~^, CSV~ and CSV^ . This, in conjunction 
with Theorem 12.31 would supply an upper bound on the expected update time. By symmetry, it is easy 
to see that E[|5P"|] = E[\SV^\] and E[\CSV~\] = E[\CSV^\]. We can thus concentrate on estimating 
E[|5P-|] andE[\CSV-\]. 

Let e be the random edge updated by a random edge update operation. For every u,v G V, let 7ro[?x, 
and 7ri[it,'u] be the shortest path from u to v before and after the update. Let Bi = {{u,v) \ e £ TTi[u,v]}, 
for i G {0, 1}, be the set of pairs of vertices connected, before and after the update, by a shortest path 
passing through e. (Note that \Bi\ = \SVi{e)\, for i E {0,1}.) It is easy to see that 7ro[n, t>] G SV^ 
if and only if e G ttqIujV] or e £ tti[u,v]. Thus, SV~ = {ttq[u,v] \ {u,v) £ Bq U Bi} and similarly 
SV^ = {-Ki[u,v] I (n, w) G Bq U Bi}. In particular = \SV^\. More importantly, 

\SV-\ < \SVo{e)\ + \SVi{e)\ . 

To bound Ed^T^"!] = E[\SV^\] it is thus enough to bound E[|57'o(e)|] = E[|57'i(e)|]. 

Lemma 7.1 The expected number of edges on a shortest path between two random vertices is (H-o(l)) In n. 

Proof: When edge weights are exponential, the expected number of edges on a shortest path between two 
random vertices is exactly equal to the average depth of a vertex in a random recursive tree of size n. (See, 
e.g., Janson [20].) It is known that this average depth is (1 + o(l)) Inn (Moon [28]). The same asymptotic 
result holds also under the uniform distribution. (See Section 2 of Janson [20].) □ 

Lemma 7.2 The expected number of shortest paths that pass through a random edge e is {1 + o(l)) Inn. 
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Proof: For every u,v €V, let tt[u,v] be the shortest path from u to v, and let |7r[n, f]| be the number 
of edges on it. For every edge e of the complete graph, let SV{e) be the set of shortest paths that pass 
through e. By symmetry we have 



W.\\5V{e)\] = E 



nin — 1) 

By Lemma [731 we get that E[|57^(e)|] = (1 + o(l)) Inn. 



E[|7r[n, v\ 



□ 



Theorem 7.3 Following a random edge update, we have E,[\SV \] = E[|iS7'^|] < (2 + o(l)) Inn. 

Let A be the maximal degree of the essential graph G* = {V, E*) defined in the previous section. Lemma [5. II 
says that with high probability A = O(logn). 

Theorem 7.4 Following a random edge update we have £[[£57^^!] = E[|>CiSP^|] = O(log^n). 

Proof: Clearly vr G CSV^ if and only if /[vr] G SV^ or r[7r] G SV^ . Each shortest path has at most 2 A 
LSP extensions. Thus \CSV'\ < 2A • \SV'\. By Lemma [5lll we have P[A > 24 Inn] = 0{n-^). As \CSV\ 
is always at most n^, we get ¥.[\CSV~\] < 481nn • E[!57'~|] + n'^ ■ = O(log^n). □ 

We believe that the O(log'^n) bound in Theorem 17.41 can be improved, possibly to O(logn), and leave it 
as an open problem. 

Theorem 7.5 The expected running time of a random edge update, when a Fibonacci heap is used to 
implement the global heap, and simple linked lists are used to implement the local heaps, is O(log^n). 



8 Concluding remarks 

We presented an algorithm that solves the APSP problem on complete directed graphs with random edges 
weights in O(n^) time with high probability. The expected running time of the algorithm is also O(n^). 
This solves an open problem of Frieze and McDiarmid [15] . 

We also presented a dynamic algorithm that performs random edge updates in O(log^n) expected time. 
It is an interesting open problem whether this can be improved to O(logn). 

Our results also hold in the directed G{n,p) model in which each edge is selected with probability p, where 
p ^ (In n)/n. Selected edges are again assigned independent, uniformly distributed, weights. Similarly, it 
is easy to see that our results apply when edge weights are integers chosen uniformly at random from, say, 
{1, 2, . . . , n}, where n is the number of vertices. 
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A The static algorithm — complete description 

In this section we give a full description, and a correctness proof, of the static version of the Demetrescu 
and Italiano [7, 8] used in this paper. Pseudo-code of the algorithm, called apsp, is given in Figured] The 
input to the algorithm is a weighted directed graph G = {V,E,c), where c : E ^ (0, oo) assigns positive 
weights (or costs) to the edges of the graph. The algorithm in Figure [1] works correctly only under the 
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assumption that all shortest paths are unique. Under essentially all probabilistic models considered in 
this paper, this assumption holds with probability 1. Algorithm apsp is also interesting, however, in non- 
probabilistic settings. For a simple way of dispensing with the uniqueness assumption, without increasing 
the running time of the algorithm by more than a constant factor, see Demetrescu and Italiano [7]. 

We next prove Theorem 12.21 of Section [2l which we repeat for the convenience of the reader. 

Theorem 12.21 // all edge weights are positive and all shortest paths are unique, then algorithm apsp 
correctly finds all the shortest paths in the graph. Algorithm apsp runs in 0{n^ ■ {Tins{n?) + T^xt{n'^)) + 
\CSV\ ■ Tdecin^)) time, where \CSV\ is the number of LSPs in the graph, and uses only 0{n'^) space. 

Proof: It is easy to check that each stage during the operation of the algorithm, dist[u, v] corresponds to 
some path from it to ?; in the graph and that this path, or an even shorter path, can be traced using the p 
and q fields. Thus, the distances returned by the algorithm can never be too small. 

It is also easy to check that the keys of the pairs (n, v) extracted from Q form a non-decreasing sequence 
and that a pair (u, v) removed from Q is never inserted to Q again. Thus, the algorithm always terminates. 

Assume, for the sake of contradiction, that the algorithm fails to find a shortest path u v, for some 
u,v (zV. Let u V he a shortest shortest-path not found by the algorithm. (In other words, if u' v' is 
shorter than u v, then u' v' is found by the algorithm.) 

If n ~^ v is simply the edge u ^ v, then we immediately get a contradiction, as the algorithm starts by 
setting dist[u,v] to c{u,v) (and p[u,v] to v, and q[u,v] to u), for every {u,v) E E. Thus, the algorithm 
does find the shortest path U'^v = u^v,a contradiction. 

Assume, therefore, that u~-^v = u^u'~^v'^vis composed of at least two edges. (If it is composed of 
exactly two edges, then u' = v' .) Clearly u ^ u' v' and u' v' ^ v are also shortest paths and their 



Function apsp(G = {V,E,c)) 

init(G) 
Q ^ heapO 

foreach {u, v) G E do 

dist[u, v] c{u, v) 
p[u, v] V 
q[u, v] u 

heap-insert((5, {u, v),dist[u, v]) 

while Q / do 

{u,v) ^ extract-min((5) 
insert(L[p[ti, v],v],u) 
insert(i?[n, q[u, v]],v) 
foreach w € -L[u, g[M, ■;;]] do 
|_ exajaine(w,u,v) 

foreach w G -R[p[n, f], do 
|_ exaj!iine(u,v,w) 



Function init(G = {V,E,c)) 

foreach u,v G V do 

dist[u, oo 
p[u, v] null 
q[u, v] ^ null 
L[u,v] ^ 
_ R[u, ^ 

foreach u G V do 

|_ dist[u, ti] ^ 



Function examine iu,v,w) 

if dist[u,v] + dist[v,w] < dist[u,w] then 
dist[u, w] dist[u, v\ + dist[v, w\ 
if p[u, w] = null then 

I heap-insert((5, (u, ), (iist[n, w]) 
else 

|_ decrease-key(Q, (u, w), dist[u, w]) 

p[u, W] -It- p[u, v] 

q[u, w] q[v, w] 



Figure 1: A static version of the APSP algorithm of Demetrescu and Italiano [7, 8j. 
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Function path(e = {u,v)} 



Function pathCvri, 7r2) 



Function pathC?;) 



return vr 




vr ^ new-path() 
/[vr] ^ p[u] 
r[7r] ^ 
start [vr] ^ u 
end[7r] ^ v 
/irst[vr] e 
Zast[vr] e 
cost[vr] c[e] 
sj)[vr] ^ false 
L[vr],i?[vr] ^0 
SL[tt],SR[tt] ^ 
insert(L[p[f]], vr) 
iiisert(i2[p[u]], vr) 
return vr 



if r[7ri] ^ Z[vr2] then error 
vr ^ new-path() 

/[vr] ^ vri 
r[vr] ^ vr2 

start [vr] ^ start [vri] 

en(i[vr] <— end[vr2] 

/irst[vr] /irst[vri] 

/ast[vr] ^ Zast [vr2] 

cost[vr] ^ c[/irst[vr]] + cost[vr2] 

sp[vr] ^ false 

L[vr],i?[vr] ^0 

SL[Tr],SR[Tr] ^ ^ 

insert(L[vr2], vr) 

insert(i?[vri], vr) 

return vr 



Figure 2: Generating new paths and inserting it into the path system. 



length is strictly smaller than the length oiu-^ v, as c{u,u'),c{v' ,v) > 0. Thus, by the our assumptions, 
u ^ u' v' and u' v' ^ v are discovered by the algorithm. When the second of these is discovered, 
the algorithm examines the path u~^v = u^u'-^v'^v and sets dist[u, v] to its length. Also (u, v) is 
added to Q if it is not already there. As there is no shorter path from u to u in the graph, the values of 
dist[u, v], p[u, v] and q[u, v] would never be changed again, contradicting the assumption that the algorithm 
does not find the shortest path from u to v. 

We next analyze the running time of algorithm. Each pair (n, v) is inserted and extracted from the priority 
queue Q at most once. The total cost of these operations is 0{n'^{Tins{n'^)+Texti^'^)))- All paths considered 
by the algorithm are LSPs. The algorithm examines each LSP exactly once. For each LSP it performs a 
constant number of operations followed perhaps by a decrease-key operation. The total cost of all these 
operations is 0{\CSV \ Tdecin'^))- The complexity of all other operations is negligible. 
Finally, to see that the algorithm uses only O(n^) space, note that the removal of a pair {u,v) from the 
heap Q causes the insertion of only two elements to lists L[u',v'] and R[u',v']. As each pair {u,v) is 
extracted at most once, the total size of all these lists is O(n^). □ 

B The dynamic algorithm — complete description 

As explained, one of the main differences between the static and dynamic algorithms is that the dynamic 
algorithm explicitly maintains all LSPs in a path system, and does not just examine them. Paths are 
created by the three constructors path(w), path(e) and path(vri, vr2) given in Figure [21 path(u) generates 
a path of length containing the vertex v. path(e) generates a path composed of the edge e. path(vri, vr2) 
takes two paths vri and vr2 such that r[vri] = /[vr2] and constructs a path vr such that /[vr] = vri and r[vr] = vr2. 
The new path vr is composed of the first edge of vri followed by vr2, or equivalently, by vri followed by the 
last edge of vr2. 

Every path vr has the following fields: 

/[vr] - A pointer to the path obtained by removing the last edge of vr. 
r[vr] - A pointer to the path obtained by removing the first edge of vr. 
start[ir] - The first vertex on vr. 
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end[7r] - The last vertex on tt. 

first[7r] - The first edge on vr. 

last[Tr] - The last edge on vr. 

cost[7r] - The total cost (weighted length) of vr. 

sp[vr] - true if and only if vr is known to be a shortest path. 

L[vr] - List of left LSP extensions of vr. 

i?[vr] - List of right LSP extensions of vr. 

5'L[vr] - List of left shortest path extensions of vr. 

^/^[vr] - List of right shortest path extensions of vr. 

The lists 5'L[vr] and ^-^[vr] are similar to the lists and R[u,v] used by the static algorithm. This 

time, however, they contain actual paths and not vertices. The lists L[vr] and -R[vr] contain all LSPs, already 
constructed, obtained by extending vr by one edge at its beginning or end, respectively. 
The initialization function of the dynamic version, called dapsp-init, is given in Figure [3l It is similar 
to the static apsp algorithm. It too uses a global heap Q that stores pairs of vertices for which shortest 
paths are sought. For every v G V, we let p[v] be the empty path consisting of v. For every edge e € E, we 
let p[e] denote the path consisting of e. For every two vertices u,v (zV, the dynamic algorithm maintains 
the following information: 

vr[M, v] - The shortest path from u to v found so far. 

cost[u, v] - The cost of the shortest path from n to f found so far. 

P[u, v] - a heap containing all the LSPs from n to v found so far. 

We refer to P[u,v] as the local heap corresponding to the pair {u,v). We refer to Q as the global heap. 

The initialization function dapsp-init starts with some obvious initializations. (For every u,v G V, it 
sets vr[u, v] to null, sets dist[u, to cx), sets P[u, v] to an empty heap, etc.) For every e G E it then creates 
the path p[e], by calling path(e), and then examines it by calling examine(p[e]), given in Figure [H 

The function examine(vr) receives a newly created LSP connecting two vertices u = start[7r] and v = en(i[vr]. 
It starts by inserting it into the heap P[u, v] with key co,st[vr]. It then checks whether vr is the first available 
LSP from u to v, or whether it is shorter than all existing LSPs between u and v. If vr is shorter than vr[M, v], 
the shortest available path from u to f , then vr[?i, f ] is clearly not a shortest path. The algorithm thus sets 
sp[vr[ii, v]] to false. It then removes all extensions of vr[u, f] from the system, if there are any. This is 
done by a call to reniove-exts(vr[ii, v], false) which we discuss later. Finally, if vr is currently the shortest 
available path from u to v, examine updates vr[M, f] and dist[u,v] accordingly. It also inserts {u,v) into the 
global heap, if it is not already there, or decreases its key to cost[vr]. (We assume that heap-insert does 
exactly that, i.e., inserts an item into a heap with a given key, or decreases its key, if the item is already 
in the heap.) 

dapsp-init then calls build-paths which is also given in Figure [31 build-paths repeatedly removes a 
pair (u, v) with the smallest key from the global heap Q. The corresponding path 7r[u, v] is then a shortest 
path. The call new-shortest-path(vr) is then made. 

The function new-shortest-path(vr) receives a newly discovered shortest path. It sets to ^^[vr] to true. It 
inserts vr to the lists 5-L[r[vr]] and S'i?[r[vr]], as vr is now a shortest path left extension of r[vr] and a shortest 
path right extension of /[vr]. (Note that vr is already contained in i[r[vr]] and -R[/[vr]] at this stage.) Most 
importantly, new-shortest-path(vr) now constructs LSPs extensions of vr and examines each one of them. 
(These operations may add new pairs into the global heap Q.) 

Using essentially the same arguments used to prove Theorem 12.21 we get that dapsp-init correctly finds 
all shortest and locally shortest paths in the graph. 
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Function update (£", c') 



Function dapsp-init (G = {V,E,c)) 




Q <— heapO 



for each u,v & V do 

Tr[u, v] null 
dist[u, w] oo 
P[u, v] ^ heapO 



foreach u € F do 

p[u] -It- path(n) 
Tr[u, u] p[u] 
dist[u, li] 



foreach e ^ E' do 



p[e] path(e) 




examine(p[e]) 



foreach e € do 

p[e] ^ path(e) 
exainine(p[e]) 



build-paths() 



Function build-paths () 



build-paths() 



while Q / do 

{u,v) extract-min((5) 
new-shortest-path(7r[u, v]) 



Figure 3: Initiating and updating the dynamic all-pairs shortest paths data structure. 



Updates are performed by calling update, also given in Figure [3l update(£", c') assigns the edges of E' 
new edges weights and recomputes all shortest paths. update(i?', c') starts by removing all paths that 
pass through edges of E'. This done by calling remove-path(p[e], true), for every e S E'. (Function 
remove-path is discussed below.) These removals create a list A of pairs (n, v) that lost their shortest 
path. For every {u,v) (z A, a call is made to replace-path(ti, u). Paths corresponding to all edges of E' 
are recreated, with their new costs, and these edge paths are examined. All updated shortest paths are 
then obtained by a call to build-paths. 

replace-path(u, u), given in Figure [5l receives a pair of vertices {u,v) such that the shortest path from u 
to V has just been destroyed. It finds the shortest path vr in P[u,v], if there is one, and performs the 
necessary updates. (Note that n is not necessarily the shortest path from u to v. It is just the shortest 
path currently available.) 

Finally, paths and their extensions are removed from the path system by the functions remove-path and 
remove-exts also given in FigureO To remove a path vr from the path system, remove-path(7r, rep) deletes 
vr from P[u,v], where u = start[7r] and v = end[7r] are the endpoints of vr. It also deletes vr from ii[/[vr]] 
and L[r[vr]]. If vr is marked as a shortest path, i.e., spin] = true, then vr is also removed from 5'i?[Z[vr]] and 
S'-L[r[vr]]. Finally, if sp[vr] = true and rep = true, then {u,v) is inserted into a list A of pairs who lost 
their shortest paths. remove-exts(vr, rep) removes all the extensions of vr from the path system, by calling 
remove-path(vr'), for every vr' € L[vr] PI i?[vr]. 

Theorem 12.31 now follows by examining the operation of the algorithm. When a shortest path is destroyed it 
is removed from its local heap. In some cases, the shortest path in the local heap is found and a pair (n, v) 
is inserted into the global heap. The total cost of these operations is at most Tdei{^) + Tmin{^) + Tins{n'^), 
where A is an upper bound on the size of the local heaps. Each new shortest path is extracted from 
the global heap at a total cost of Text(ra^). Each LSP destroyed is removed from its local heap at a cost 
of Tfieii^)- Finally, each LSP created is inserted into the appropriate local heap and possibly causes a 
decrease-key operation on the global heap, a total cost of Tj„s(A) -|- Tdecin'^)- 
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Function new-shortest-path(7r) 



Function examine (tt) 



sp[Tr] <— true 

insert(S'L[r[7r]], tt) 
insert(S'i?[Z[7r]],7r) 

foreach vr' G SL[l[Tr]] do 
tt" ^ path(7r',7r) 
examine(7r") 

foreach tt' € S'i2[r[7r]] do 
tt" ^ path(7r, tt') 
exaiiiine(7r") 



Figure 4: The functions 



u <— start[ir] ; v <— end[7r] 
heap-insert(P[u, v],7t, costlir]) 

if cost[7r] < dist[u, v] then 

if 7r[n, f] 7^ null then 
sp[7r[u, w]] ^ false 
remove-exts(7r[tt, u], false) 

Tr[u, v] TT 

dist[u,v] cost[7r] 
heap-insert((5, {u,v),cost[TT]) 



r-shortest-path and examine. 



Function remove-path (tt, rep) 

u ^ start [tt] ; v ^ end[TT] 
heap-delete(P[ti, v],tt) 

delete(i?[/[7r]],7r) 
delete(L[r[7r]], tt) 

if sp[tt] = true then 
if rep = true then 
|_ insert(^, {u, v)) 

delete{SR[l[TT]],TT) 
_ delete(S'L[r[7r]],7r) 

remove-exts(7r, rep) 



Function remove-exts(7r, rep) 

foreach tt' G L[7r] U ii![7r] do 
|_ remove-path(7r', rep) 



Function replace-path(ii, 

if P[u,v] / then 

TT f ind-min(P[?x, v]) 

tt[u, v] <— tt 

dist[u, v] ^ COSt[TT] 

heap-insert((5, {u, v), cost[TT]) 
else 

tt[u, v] <— null 

disf[ii. v] ^ DC 



Figure 5: Removing a path and its extensions from the path system. 
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